Maintaining large update batches by restructuring and grouping
نویسندگان
چکیده
Materialized views defined over distributed data sources can be utilized by many applications to ensure better access, reliable performance, and high availability. Technology for maintaining materialized views is thus critical for providing upto-date results since a stale view extent may not help or even mislead these applications. State-of-the-art incremental view maintenance requires OðnÞ or more remote maintenance queries with n being the number of data sources in the view definition. In this work, we propose two novel maintenance strategies, namely adjacent grouping and conditional grouping, that dramatically reduce the number of maintenance queries required to maintain the materialized views. This reduction in the number of maintenance queries brings the basic trade-off between the complexity of each query and the total number of maintenance queries that can be exploited to improve maintenance performance. The proposed maintenance strategies have been implemented in a working prototype system called TxnWrap. Experimental studies illustrate that our proposed strategies are able to achieve about 400% performance improvement in terms of total processing time compared with existing batch algorithms in a majority of cases. r 2006 Elsevier B.V. All rights reserved.
منابع مشابه
Restructuring View Maintenance Plans for Large Update
Materialized views defined over distributed data sources are a well recognized technology in data integration, e-business and semantic web. Due to the constantly increasing size of the information sources and the rapid rates of change, there comes an increasing pressure to reduce the time taken for refreshing such integration views. State-of-the-art incremental view maintenance literature requi...
متن کاملEvaluation of Updating Methods in Building Blocks Dataset
With the increasing use of spatial data in daily life, the production of this data from diverse information sources with different precision and scales has grown widely. Generating new data requires a great deal of time and money. Therefore, one solution is to reduce costs is to update the old data at different scales using new data (produced on a similar scale). One approach to updating data i...
متن کاملافزایش سرعت نگهداری افزایشی دید با استفاده از الگوریتم فاخته
Data warehouse is a repository of integrated data that is collected from various sources. Data warehouse has a capability of maintaining data from various sources in its view form. So, the view should be maintained and updated during changes of sources. Since the increase in updates may cause costly overhead, it is necessary to update views with high accuracy. Optimal Delta Evaluation method is...
متن کاملMedian Confidence Intervals - Grouping Data into Batches and Comparison with Other Techniques
Confidence intervals around the median of estimators are proposed as a substitute for confidence intervals around the expectation. This is adequate since for many estimators the median and the expectation are close together, or even coincide, particularly if the sample size is large. Median confidence intervals are easy to obtain, the variance of the estimator is not used. They are well suited ...
متن کاملBusiness Restructuring as a Method of Strengtening Company’s Financial Position
Restructuring is relevant for companies that have free capital and need to expand for development purposes, as well as for companies that have relatively large problems with financial results and the relevant indicators indicate the necessary changes. Motives of the restructuring may be different, the authors put forward the following reasons: operation operational synergy, financial synergy, ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- Inf. Syst.
دوره 32 شماره
صفحات -
تاریخ انتشار 2007